Search results for " LCP"

showing 4 items of 4 documents

Detecting mutations by eBWT

2018

In this paper we develop a theory describing how the extended Burrows-Wheeler Transform (eBWT) of a collection of DNA fragments tends to cluster together the copies of nucleotides sequenced from a genome G. Our theory accurately predicts how many copies of any nucleotide are expected inside each such cluster, and how an elegant and precise LCP array based procedure can locate these clusters in the eBWT. Our findings are very general and can be applied to a wide range of different problems. In this paper, we consider the case of alignment-free and reference-free SNPs discovery in multiple collections of reads. We note that, in accordance with our theoretical results, SNPs are clustered in th…

0301 basic medicineFOS: Computer and information sciences000 Computer science knowledge general worksBWT LCP Array SNPs Reference-free Assembly-freeLCP ArraySettore INF/01 - Informatica[SDV]Life Sciences [q-bio]Reference-freeAssembly-freeSNP03 medical and health sciences030104 developmental biologyBWTBWT; LCP Array; SNPs; Reference-free; Assembly-freeComputer ScienceComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)[INFO]Computer Science [cs]SoftwareSNPs
researchProduct

Docosahexaenoic acid, but not eicosapentaenoic acid, lowers ambulatory blood pressure and shortens interval QT in spontaneously hypertensive rats in …

2009

International audience; This study was designed to evaluate the effects of individual dietary long-chain n-3 polyunsaturated fatty acids (LCPUFA) on hypertension and cardiac consecutive disorders in spontaneously hypertensive rats (SHR) as compared to Wistar-Kyoto rats (WKY). Rats were fed for 2 months an eicosapentaenoic (EPA)- or docosahexaenoic acid (DHA)-rich diet (240 mg/day) or an n-3 PUFA-free diet. Male SHR (n=6), implanted with cardiovascular telemetry devices, were housed in individual cages for continuous measurements of cardiovascular parameters (blood pressure (BP) and heart rate (HR)) during either activity or rest periods, ECG were recorded during the quiet period. The n-6 PU…

MaleecgClinical BiochemistryBlood Pressure030204 cardiovascular system & hematologyEssential hypertensionRats Inbred WKYElectrocardiographychemistry.chemical_compound0302 clinical medicineRats Inbred SHRmembrane2. Zero hungerchemistry.chemical_classification0303 health sciences[SDV.BA]Life Sciences [q-bio]/Animal biologytelemetryEicosapentaenoic acid3. Good healthshrEicosapentaenoic AcidDocosahexaenoic acidHypertensioncardiovascular systemArachidonic acidlipids (amino acids peptides and proteins)Polyunsaturated fatty aciddietary n-3 lcpufamedicine.medical_specialtyCardiotonic AgentsDocosahexaenoic AcidsLinoleic acidheartBiology03 medical and health sciencesFatty Acids Omega-6Internal medicinemedicineAnimalsUnsaturated fatty acidphospholipid030304 developmental biologyMyocardiumessential hypertensionCell Biologymedicine.diseaseRatsblood pressure monitoringEndocrinologyBlood pressurechemistryEndothelium Vascular[SDV.AEN]Life Sciences [q-bio]/Food and Nutrition
researchProduct

Lightweight LCP construction for next-generation sequencing datasets

2012

The advent of "next-generation" DNA sequencing (NGS) technologies has meant that collections of hundreds of millions of DNA sequences are now commonplace in bioinformatics. Knowing the longest common prefix array (LCP) of such a collection would facilitate the rapid computation of maximal exact matches, shortest unique substrings and shortest absent words. CPU-efficient algorithms for computing the LCP of a string have been described in the literature, but require the presence in RAM of large data structures. This prevents such methods from being feasible for NGS datasets. In this paper we propose the first lightweight method that simultaneously computes, via sequential scans, the LCP and B…

Whole genome sequencingGenomics (q-bio.GN)FOS: Computer and information sciencesSequenceBWT; LCP; next-generation sequencing datasetsBWT LCP text indexes next-generation sequencing datasets massive datasetsSettore INF/01 - InformaticaComputer scienceComputationString (computer science)LCP arrayParallel computingData structureDNA sequencingSubstringBWTLCPFOS: Biological sciencesComputer Science - Data Structures and AlgorithmsQuantitative Biology - GenomicsData Structures and Algorithms (cs.DS)next-generation sequencing datasets
researchProduct

SNPs detection by eBWT positional clustering

2019

Sequencing technologies keep on turning cheaper and faster, thus putting a growing pressure for data structures designed to efficiently store raw data, and possibly perform analysis therein. In this view, there is a growing interest in alignment-free and reference-free variants calling methods that only make use of (suitably indexed) raw reads data. We develop the positional clustering theory that (i) describes how the extended Burrows–Wheeler Transform (eBWT) of a collection of reads tends to cluster together bases that cover the same genome position (ii) predicts the size of such clusters, and (iii) exhibits an elegant and precise LCP array based procedure to locate such clusters in the e…

lcsh:QH426-470Computer scienceLCP arrayReference-free[SDV]Life Sciences [q-bio]0206 medical engineeringSequencing dataSNPAssembly-free02 engineering and technologyBWT LCP array SNPs Reference-free Assembly-freecomputer.software_genreSoftwareBWTStructural BiologyComputational Theory and MathematicCluster (physics)Cluster analysislcsh:QH301-705.5Molecular BiologyComputingMilieux_MISCELLANEOUSSettore INF/01 - Informaticabusiness.industryResearchApplied MathematicsLCP arrayData structurePipeline (software)lcsh:GeneticsComputational Theory and Mathematicslcsh:Biology (General)Data miningBWT; LCP array; SNPs; Reference-free; Assembly-free[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]businessRaw datacomputer020602 bioinformaticsSNPs
researchProduct